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FOREWORD 


Phase lA of LACIE was a period of operation that existed 
before the first classification system meeting the LACIE 
requirements was delivered and prior to the development of 
validated CAMS procedures. The purpose of this report Is 
to provide an Initial Indication of the early LACIE system 
design performance In a few critical areas. 

During the November/December 1974 operation, the Landsat I 
multlspectral scanner data were retrospectively ordered for 
an area in central and southwestern Kansas. With respect to 
the blostages utilized In LACIE Phase I, segments per bio- 
stage were ordered as follows: blostage lA, 3 segments; bio- 

stage IB, 6 segments; blostage II, 5 segments; blostage III, 

3 segments; and blostage IV, 11 segments. 

In five Intensive test sites, multi temporal data were re- 
ceived, which allowed the accuracy of classification to be 
evaluated for each blostage. The data received from the 
LACIE Phase lA LACIE Operation Report were: for sites 1106 

and 1034, olostages I, II, III, and IV^ were received; for 

^During LACIE Phase lA, blostages were defined as: 

I - Sowing to jointing 
II - jointing to soft dough 
III - soft dough to harvest 
IV - harvest to 36 days after harvest 
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site 1114, biostages II and IV were received; for site 1042, 
biostages I, III, and IV were received; and for site 1111, 
biostages III and IV were received* 

The accuracy assessment activity involved comparisons of auto- 
matic data processing classification, analyst interpreter 
findings, and acreage estimation results to other available 
information, such as ground- truth data from SRS enumerative 
sites and the intensive test sites. These comparisons were 
also made to the official Statistical Reporting Service 
estimates for Kansas for the 1973-1974 crop year, and statis- 
tical analysis was performed on the data thus obtained. 
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SECTION 1.0 


SUMMARY 

The results of the accuracy assessment activity for Phase lA 

of LAC IE indicated that 

1. the 90/90 criteria could be reached if the degree of 
accuracy of the LACIE performance^ in Kansas could be 
equaled in other areas. 

2. the classification of both wheat and nonwheat fields is 
significantly accurate for the three ITS segments ana- 
lyzed in detail. The wheat field classification accuracy 
varied significantly for the segments. However, this was 
not so with respect to nonwheat fields. 

3. biophase as well as its interaction with segment location 
turned out to be a significant factor for the classifi- 
cation performance. Analyst interpretation of segments , 
for training the classifier was a signifi'"ant error- 
contributing factor in the estimation of wheat acreage 

at both the field and the segment levels. 

^Only 28 of the 84 Kansas segments were processed, and these 
were limited to six crop reporting districts, only two of 
which had most of the segments that were allocated. There- 
fore, this was a limited test over a major wheat-producing 
region, and the accuracy of extrapolating these results to 
the entire central plains, including more marginal areas, 
would be questionable. 
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4. low percentages o£ wheat were overestimated and that high 
percentages of wheat were underestimated. (Regression 
showed a correlation coefficient, r, of .52 between ac- 
tual and estimated percentages of wheat over five Kansas 
intensive test sites. 

5. classification variances were approximately 3.6 times 
larger than the sample variance. 
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SECTION 2.0 


lACIE ACCURACY ASSESSMENT - NOVEMBER/DECEMBER OPERATIONS 


2.1 CAS AGGREGATION 

In a CRD that is composed of only Group I and Group III type 
counties, the acreage estimate is computed using the following 
formula: 


A 


Y = 






(2-1) 


where 

th 

p = Computed LACIE estimate of wheat proportion in k 
county. 

A = Total acreage in county reported in 1969. 

iv 

W “ Total wheat acreage during 1969 in m counties for 
whxch LACIE estimates are computed. 

W = Total wheat acreage during 1969 in the remaining 
o 

counties, say (n - m) , for which no LACIE estimates 
are computed. 


The variance of Y is given by 



( 2 - 2 ) 
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2 ^ 
where denotes the variance of estimate Pj^. (Observe 

A 2 

that if is a biased estimate ^ then Sj^ should be con** 

A 

sidered the mean square error (MSB) for Py^, and thereby one 

A 

can obtain the MSB instead of the variance of Y by comput** 
2 

ing S .) In present computations we shall assume that P^^ 
is unbiased for all counties. 


Next, for the collection of all six CRD's, a wheat acreage 
estimate and its variance is obtained by 


A 6 A 

X » r Y. (2-3) 

i=l ^ 


and 


2 2 

0^ » Z S/ (2-4) 

i»l ^ 


where 


A A 

Y^ = LACIE estimate for i CRD. 


2 

* Variance of Y^. 


To compute the variance in equation (2-2 ) , and thereafter in 

2 

equation (2-4 ) , the variances within counties, Sj^ , 
k = 1, 2, . . . , m, need to be computed. Since a single seg- 

A 

ment is used in obtaining Pj^ for the kth county. 
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2 

k > 1, 2, m, no direct estimate of Sj^ can be made. 

2 

However r an estimate of Sj^ , k « 2, . . . , m, can be achieved 

in the following indirect way. 

From the 28 segments processed (table 2-1), the average wheat pro- 
portion computed is 0.326, and an estimate of the variance between 
segments is 0.01488. On the other hand, using the 1969 census 
data for the 28 counties for which the segments were processed, 
the variance between county wheat proportions is 0.00489. Now, 
considering that the variance between segments is composed of 
the variance within counties and the variance between counties, 
the variance within counties, a measure of 8^unpling error, is 
obtained by the difference (0.01488 - 0.00489 « 0.00999). 

However, if necessary, as is the case presently, it needs to 
be corrected with respect to the bias associated in estimating 
the average wheat proportion of all 28 counties. An estimate 

of this bias is 0.326 - 0.30 0.026, where 0.30 is the SRS 

2 

estimate *.or 1973-74. Accordingly, an estimate cf Sj^ 

is 0.00999 - 0.026^ ^ 0.0093. Now, replacing by 0.0093 

2 . . 

in equation (2-2), an estimate of S is obtained. Similarly, 

2 

one obtains an estimate of o in equation (2-4) . 

Presented in table 2-II are each CRD wheat acreage estimate 
and an estimate of its standard deviation, coefficient of 
variation (CV) , bias, and the confidence level for the bias 
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to be within plus or minus 10 percent of the actual wheat 
acreage. Bias is calculated in terms of the difference between 
a LACI£ estimate and its corresponding SRS estimate for the 
1973-1974 crop year. The formula for computing the confidence 
level a is given oy 


0 ■ ♦ 


/O.l - k\ A !o,l + k\ 


(2-5) 


where cv = — 


s « standard deviation 
B - E(Y) - Y 

E(Y) » expected acreage estimate 
Y * true acreage 


K = ^ 


X z 

1 r "“T 

♦ (X) E e dz 

J 
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TABLE 2-1.- SAMPLE SEGMENTS FOR NOVEMBER SYSTEM 


State or CRD County or Site Site Aggregated 


region 


site neune 

coordinates 

number 


Kansas 

1 

Graham 

N39:29 

- 

W099:55 

1018 

X 

Kansas 

2 

Osborne 

N39:32 

- 

W099:00 

1101 

X 

Kansas 

2 

Ottawa 

N39;ll 

- 

W097:35 

1102 

X 

Kansas 

2 

Rooks 

N39;ll 

- 

W099:07 

1103 

X 

Kansas 

4 

Lane 

N38;25 

- 

W100:26 

1026 


Kansas 

4 

Ness 

N38:36 

- 

W099:47 

1028 

X 

Kansas 

4 

Scott 

N38:28 

- 

W100:46 

1029 

X 

Kansas 

4 

Trego 

N39:02 

- 

W099:49 

1030 


Kansas 

5 

Barton 

N38:37 

- 

W098:50 

1104 

X 

Kansas 

5 

Dickinson 

N33:51 

- 

W099:ll 

1105 


Kansas 

5 

Ellis^ 

N33;50 

- 

W099:13 

1106 

X 

Kansas 

5 

Ellsworth 

N38:37 

- 

W098:13 

1107 

X 

Kansas 

5 

Lincoln 

N39:09 

- 

W098:14 

1108 

X 

Kansas 

5 

Marion 

N38;32 

- 

W097:08 

1109 

X 

Kansas 

5 

McPherson 

N33:22 

- 

W097:23 

1110 

X 

Kansas 

5 

Rice^ 

N33:17 

- 

W098;13 

1111 

X 

Kansas 

5 

Rush 

N38:33 

- 

W093:05 

1112 


Kansas 

5 

Russell 

N39:05 

- 

W093!50 

1113 


Kansas 

5 

Saline^ 

N33:52 

- 

W097:28 

1114 

X 

Kansas 

7 

Hodgeman 

N33:12 

- 

W098:41 

1039 


Kansas 

7 

Kearny 

N37:56 

- 

W101:17 

1040 

X 


Intensive Test Sites 
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TABLE 2-1.- Concluded 


State or 
region 

CRD 

County or 
site name 

Site 

coordinates 

Site 

number 

Aggregated 

Kansas 

7 

Morton^ 

N37:16 

- 

W101:54 

1042 

X 

Kansas 

7 

Seward 

N37:20 

- 

W100:38 

1043 

X 

Kansas 

7 

Stanton 

N37:43 

- 

Wl.2;01 

1044 

X 

Kansas 

7 

Stevens 

N37:15 

- 

W101:06 

1045 

X 

Kansas 

8 

Edwards 

N37;45 

- 

W099;23 

1115 

X 

Kansas 

8 

Pawnee 

N38;07 

- 

W099;03 

1116 

X 

Kansas 

8 

Reno 

N38:09 

- 

W097:49 

1117 

X 

Ka *" as 

8 

Reno 

N37;54 

- 

W098:24 

1118 


Kansas 

8 

Stafford 

N38;02 

- 

W098;40 

1119 

X 

Kansas 

7 

Finney^ 

N38:04 

- 

W101;02 

1034 

X 

Kansas 

7 

Grant 

N37:43 

- 

W101:20 

1036 

X 

Kansas 

7 

Gray 

N37;48 

- 

W100;38 

1037 

X 

Kansas 

7 

Hamilton 

N38;03 

- 

W101;38 

1038 

X 

Kansas 

7 

Haskell 

N37:29 

- 

W101:00 

1065 

X 

Colorado 

9 

Baca 

N37:29 

— 

W102:48 

1012 



^Intensive Test Sites 
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TABLE 2-1 I.- CRD WHEAT ACREAGE ESTIMATES AND THEIR MEASURES OF ACCURACY 
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SECTION 3.0 


ERROR ANALYSIS - NOVEMBER/DECEMBER LACIE OPERATIONS 
3.1 LACIE DATA ANALYSIS 

Five intensive test site (ITS) segments located in Kansas 
were acquired and processed during the November-December 1974 
Large Area Crop Inventory Experiment (LACIE) operation. For 
three of these segments, number 1034 in Finney, number 1042 
in Morton, and number 1106 in Ellis Counties, multispectral 
scanner (MSS) data processing was completed for four bio- 
logical phases. For the remaining two segments, nximber 1111 
in Rice and number 1114 in Saline Counties, data processing 
was done for two biological phases. Two photointerpretations 
of each segment for its available biophases were completed by 
two randomly selected Analyst Interpreters (AI's). As a re- 
sult, two sets of training data were used to obtain two dif- 
ferent classification runs for each biophase of a segment. 

For the classification and acreage performance evaluation, 
approximately 20 test fields, 10 wheat and 10 nonwheat fields, 
were randomly selected from the ITS area inside a segment, 
and the correctly classified pixels for each of these fields 
were determined under the Classification and Mensuration 
Subsystem (CAMS) operation. Also computed were percentages 
of wheat within an ITS area for each classification run. 
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Thus, the wheat acreage performance was evaluated at two 
levels: field size and ITS area size. 

3.1.1 Method 

The analysis of covariance technique is employed to analyze 
the classification and acreage performance data. This tech- 
nique determines the contribution by each factor in the vari- 
ance observed in the response measurements, in the presence 
of different covariable values. Considering a linear relation- 
ship between the response variable (e.g., LACIE-computed wheat 
acreage) and the covariable (e.g., actual wheat acreage), 
it regresses the former on the latter. It accounts for the 
variation in the response variable caused by the variation in 
the covariable. It computes the residual variation, total 
variation less that due to the covariable, and further provides 
a breakdown by various factors and any interactions between 
them, including that due to natural variation, called the 
error component. 

3.1.2 Field Data 

Three intensive test site segments (1034, Finney; 1106, Ellis; 
1042, Morton) which were observed for all four biophases are 
considered for data analysis. Separate data analyses are made 
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for wheat and nonwheat fields performance data. Pairing the 
LACIE-computed acreage (response variable) # wheat or nonwheat 
as the case may be, with the actual acreage (covariable) for 
each selected field, the variance observed in the LACIE- 
computed acreages was analyzed in terms of variations caused 
by the field size, segment location, biophase, and photo- 
interpretation. Details on the underlying design model, 
numerical values of main coefficients in the model, and the 
analysis of variance tables are given in Appendix A. 

3. 1.2.1 Wheat error variance .- In the case of wheat , the total 
variance depicted in the data is 2848.66, and the residual 
variance after accounting for the field size variation is 
1307.93. So the variation in the field size accounts for 
54 percent of the total variance: 

2846.66 - 1307.93 .. 

28406 ’’ 

The error variance evaluated after accounting for the factors 
of segment location, biophase, photointerpretation, and factor 
interactions is 730.79, implying that all these factors and 
their interactions account for 44 percent of the residual 
variance : 


1307.93 - 730.79 ^ 
1307.93 


44 
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or, 20 percent of the total variance: 


1307,93 “ 730,79 
2848,66 


X 100 


20 


This leaves 56 percent of the residual variance or 26 percent 
of the total variance unaccounted for. 


In the absence of knowledge about any other factor influenc- 
ing the performance, the leftover portion provides an estimate 
of natural/inherent variation in the system. Thus, in the 
present case, the system error at the field level is estimated 
to be 27 (i.e., \^30 . 788 ) pixels, or roughly 30 acres in terms 
of standard deviation. By no means can this be regarded low, 
and, hence, the system needs improvement for it to be con- 
sidered effective and successful. 

As for the significance of different factors, the classifica- 
tion and acreage performance is significantly different from 
one biophase to another, but no such evidence exists in the 
case of segments. However, none of the biophases is uniformly 
better or worse when all segments are considered together. 

This implies that the system performance depends simultaneously 
upon the segment location and the biophase. This could be ex- 
pected because the effect of these two factors is dependent 
upon the types of competing crops and their state of growth 
in a segment location. And so, if either the competing crops 


3-4 



or their growth differs from segment to segment, the system 
performance can be expected to behave as shown presently. 

Next, there is a strong evidence of significant variation in 
the system performance due to different photointerpretations 
of a segment. This means a significant difference exists in 
training the classifier for automatic data processing class- 
ification and can be due to several reasons. 

A. One AI photointerprets the fields better than the other. 

B. Equal performance of AI*s in terms of correct identifies- 

Y tion, but different sets of training fields are selected. 

C. Different types of bias in selecting training fields by 
AI’s; e.g., one AI might be inclined to select a larger por- 
tion of a field than the other, or they might pick up a dif- 
ferent number of training fields for various crops in a 
segment. 

However, this is an indication that the inherent overlap in 
the MSS data of different crops is not too large to remain 
unaffected by AI performance; and given a better set of train- 
ing fields, classification performance could be improved con- 
siderably. Refer to the technical report, Chhikara (March 
1975) , "Effect of Analyst Misinterpretation of Training Data 
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on Classification Performance,** LEC-4353, which throws light 
on such theoretical aspects of this problem. 


* 


3.1. 2. 2 Non-wheat error variance .- In the case of nonwheat , 
the total variance obtained is 2914.60, and the residual 
variance is 581.12. Since 


2914.60 - 581.12 
2914.60 


X 100 


80 


the variation in field size accounts for 80 percent of the 
total variance. Next, the error variance is 463.95, implying 
all other factors and their interactions account for 20 per- 
cent of the residual variance: 


581.12 - 463.95 ^ 
581.12 


20 


or, 4 percent of the total variance: 


581.12 - 463.95 ^ 
2931760 


4 


This means that 80 percent of the residual variance or 16 per- 
cent of the total variance remains unaccounted for. Once 
again, this percentage needs to be reduced for the system to 
have an acceptable performance. 

In this case, segment location, biophase, and interaction 
between the two are all significant factors, which means that 
the system performance is very much dependent upon what 
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segment is processed and for what blophase. However, the 
factor of AI is not significant. This could be due to merg- 
ing all competing crops into one category - nonwheat. 

3.1.3 ITS Wheat Percentage Data 

The actual wheat percentage and the LACIE-computed wheat per- 
centage for each ITS area in a segment processed for different 
biophases are given in table A-IV of appendix A. Altogether 
there are 30 response measurements, and these are not arranged 
in a design that can support an extensive statistical analysis. 
So any data analysis for this set of measurements will be 
limited in scope. Often, a much larger data set is needed for 
a proper and valid statistical inference, particularly when 
the system generating the data could cause large variation. 

As such, considering only the biophase factor and the actual 
wheat percentage (i.e., the covariable, which also represents 
the segment location factor in the model) , an analysis of 
variance is presented in table A-V. The coefficient values 
for the terms in the model are presented in table A-VI of 
Appendix A. The total variance in data is 276.74, and 21 per- 
cent of it is accountable by the variation in actual wheat 
proportion in an ITS. The residual variance is 217.78, and 
most of it remains unaccounted for, implying that the factor 
for biophase is not significant compared to the inherent system 
variability (see fig. 3-1) . In terms of standard deviation, 
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LACIE computed wheat percent 





the latter is estimated to be 14.8 percent. Besides having 
this large a variation, there is an indication of an overall 
bias in estimating the wheat acreage. As pointed out earlier, 
this sort of conclusion is very tentative, and a larger set of 
experimental data is needed for a definitive study. 
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SECTION 4.0 


ESTIMATION OF VARIANCE COMPONENTS OF LACIE ACREAGE 
ESTIMATES - NOVEMBER/DECEMBER 1974 OPERATION 

4.1 LACIE DATA ANALYSIS 

The LACIE acreage estimate of a crop reporting district or 
state is subject to both the classification errors and the 
sampling errors. So, the variance of a LACIE estimate con- 
sists of two components: (1) variance due to classification 

errors, and (2) variance due to seunpling error. In order to 
analyze the LACIE system performance, it is desirable to 
separate these two variance components and to evaluate their 
impact individually on the reliability of an acreage estimate. 

During the LACIE-IA phase (November /December 1974 operation) 
segments acquired and processed were from Group I counties 
only. Acreage estimates were made for 28 counties, each con- 
taining a segment for which the wheat proportion was calcu- 
lated, and 6 CRD's. In assessing these estimates for accuracy 
and reliability, their variances were estimated in section 1. 
Since these were all Group I counties, the variance estimates 
were obtained using within county variance, which was as- 
sumed to be the same for all the counties in the six CRD's. 

As such, the problem of variance decomposition reduces to 
obtaining a break down of the LACIE system variance into the 
two components mentioned in the previous paragraph. 
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4.2 ESTIMATION OF THE SAMPLING ERROR COMPONENT 
I;i a conoRunlcation (ref. 7), Fred Warren (affiliated with the 
Statistical Reporting Service, USDA) reported an estimate 
of 0.011983 for the within county variance for the SRS segments 
(average size - 823.4 acres) in Kansas. This was obtained 
using actual wheat proportions of 460 SRS segments distributed 
over 97 counties in 9 CRO's. Since the ZACIE segment is 
larger in area (25426.5 acres), a smaller within county vari- 
ance is expected because the actual wheat proportion for the 
liACIE segment will be less variable than that for the SRS 
segment in the county. Thus, the figure of 0.011983 is an 
upper bound for the within county variance for the LACIE seg- 
ment. On the average, there are 30 SRS segments having the 
same area as 1 LACIE segment. If there is no correlation 
among wheat proportions of 30 SRS segments mailing up each LACIE 
segment, then selection of 1 LACIE segment in a county is equi- 
valent to a random selection of 30 SRS segments in the county. 
In this case, ignoring classification errors, an estimate of 
within county variance for the LACIE segment is 
0.011983/30 - 0.000394. Since a certain amount of intra- 
segment correlation (i.e. , the correlation among SRS segment 
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wheat proportions in a LACIE segment) is expected, this esti- 
mate of 0.000394 is, in fact, a lower bound for the within 
county variance for the LACIE segment. Thus, an unbiased 
estimate of the within county variance for the LACIE segment, 
which is the sampling error component since no classification 
errors are considered here, is expected to fall in between 
0.000399 and 0.011983. 

If the intra-segment correlation coefficient r is determined 
with respect to SRS segments, then an estimate of the sampl- 
ing error variance component for the LACIE segment can be 
obtained as (e.g., see ref. 5, p. 242) 

' (i - S) ^ - V C '“-i* 

where, in the case of Group I countie-' - 

n = 1, the number of LACI} segments selected in a county. 

N = 21.5, the average number of LACIE segments in a 
county . 

M = 30, the average number of SRS segments having the 
same area as one LACIE segment. 

2 

S^ = 0.011983, within county variance for the SRS seg- 
ment as obtained in reference 7. 
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To determine a value of r, one needs to know the actual wheat 
proportions for the SRS segments clustered in different LACIE 
segments in a county. In general, this information is not 
available. However, an estimate of r is obtained in the 
following way. 

Each of the five intensive test sites (ITS's) in Kansas was 
partitioned into 1 -square-mile areas, and the wheat proportion 
for each of these areas was determined. For these wheat pro- 
portion data, an analysis of variance is presented in 
table 4-1. Based on this analysis, an estimate of r is ob- 
tained equal to 0.15. (See Appendix B for details.) 

Now, making substitutions in equation (4-1) , one gets 

[^ ■ 2 I 3 ] ^ ^30 ~ (0-011983) = 0.00204 (4-2) 

Hence, the sampling error component (i.e. , within county vari- 
ance when no classification errors are made) for the LACIE 
segment is estimated to be 0.00204. 

This estimate is derived on the basis of wheat proportion data 
from five ITS's, of which two were 5- by 6-square-mile areas 
and three were 3- by 3-square-mile areas. From the reliability 
viewpoint, this is a small number of samples. For example, 
the estimate of 0.00204 for the within county variance seems 
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TABLE 4-1.- ANALYSIS OF VARIANCE - WHEAT PROPORTION PER 
1-SQUARE-MILE AREA (SRS SEGMENT) FOR FIVE INTENSIVE 
TEST SITES IN KANSAS 


Source of variation 

Degrees 

of 

freedom 

Sum of squares 

Mean squares 

Total 

86 

3.242011 

0.037698 

LACIE segment 

4 

.930369 

.232592 

SRS segment 

82 

2.311642 

.028191 
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large, and could be due to employing only five segments, three 
only partially available, located in five different counties 
in the state. Had there been more segments available, it 
could be that a smaller and better estimate of the variance 
would have resulted. 

4 . 3 VARIANCE COMPONENTS 

Utilizing the CAMS computed wheat proportions for the 28 seg- 
ments processed during the November/December 1974 operation, 
an estimate of LACIE system variance was obtained of 0.0093. 
(See ref. 4 for details on its computation.) Since this is 
an estimate of the sum of two variance components, one due to 
classification errors and the other due to sampling error, 
and the sampling error variance component is estimated to be 
0.00204 in the previous section, an estimate of the classifica- 
tion error variance component is 0.0093 - 0.00204 = 0.00726. 
This is almost 3.6 (i.e., 0.00726/0.00204 times, as big as the 
sampling error component of the total error) . 

Thus, the classification component of the variability in esti- 
mating the wheat proportion in a county dominates heavily the 
sampling error component, even when only one segment per 
county is selected--. It is conjectured that this trend in 
their relative magnitudes will continue to increase when more 
and more segments are aggregated for a crop acreage estimate. 
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If this is true, it implies that the classification component 
of the variance in acreage estimation will become more and more 
critical compared to the sampling error component as the level 
of aggregation goes up from the CRD level. Therefore, if the 
LACIE 90/90 criteria is not met at the desired aggregation 
level under the current sample segment allocation, additional 
sampling of segments may help in achieving it (the 90/90 
criteria) . This assumes that the ADP classification component 
of the variance does not show any significant increase relative 
to the sampling error component, which will be reduced by 
sampling of more segments. 
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APPENDIX A 


Model 

The field data structure generated by CAMS operation is appro- 
priately described by the model : 


i = 1, 2, 3 (segment number) 
j » 1, 2, 3, 4 (biophase number) 
k =» 1/ 2 (AI number) 
i = 1, 2, N^j (field number) 

where 


LACIE- computed acreage corresponding to the Ith 
test field of the ith segment for kth photoin- 
terpretation of jth biophase. 

Actual size of Ith test field of ith segment for 
kth photointerpretation of jth biophase. 

Overall bias at X = 0 . 

= ith segment effect on acreage performance. 

B. = jth biophase effect on acreage performance. 


Y. . 
ijkil 


^ijkA 


A = 
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(SB)^j * Effect of interaction between ith segment 
and jth biophase on acreage performance. 

^k(i) ** photointerpretation effect on acreage 
performance in ith segment. 


= Effect of interaction between jth biophase 
and kth photointerpretation on acreage per- 


formance in ith segment. 


^ijkJl ” Er^ror term associated with 


The basis for this model is a cross classification between seg- 
ments and biophases but a nested classification for AI's in 
segments. The analysis of variance obtained under this model 
for wheat and nonwheat field data is shown in tables A-I and 
A-II. Numerical values of coefficients of different terms in 
the model are given in table A-III. 


Since the field size varies and may cause a larger (or smaller) 
residual r corresponding to a large (or sma)!) field, the model 
needs to be checked for its adequacy. Except for a slight 
trend, residuals were found well scattered and random. To 
investigate it further, we analyzed the field data somewhat 
differently: we computed p , the proportion of correctly 

classified pixels per test field; applied a variance stabili- 
zing transformation, 
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Y » Arc sin ^ 


and performed the analysis of variance for Y's. Once again 
our conclusions regarding the significance of different fac 
tors in each case, wheat or nonwheat, were the same as men- 
tioned previously. 
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ANALYSIS OF VARIANCE - WHEAT FIELD DATA 















TABLE A- I I.- ANALYSIS OF VARIANCE - NONWHEAT FIELD DATA 







TABLE A-III.- MAIN COEFFICIENTS FOR THE MODEL 


Term in 

Coefficient value 

model 

Wheat 

Nonwheat 

S 

0.646 

0.924 

A 

0,54 

-6.5 


10.5 

3.0 


-15.4 

14.1 

®3 

4.9 

-17.1 


-21.9 

6.7 


10.7 

-0.9 


3.9 

-7.1 

®4 

7.3 

1.3 
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TABLE A- IV.- ITS WHEAT PERCENTAGE DATA 




Wheat percentage I 

ITS segment 

Biophase 








Actual 


1034 (Finney) 

I 

23 

17.49 

13.39 


II 


43.90 

27.14 


III 


24.48 

21.00 


IV 


37.12 

22.98 

1042 (Norton) 

I 

40 

37.33 

40.52 


II 


41.46 

26.20 


IV 


31.37 

22.00 

1106 (Ellis) 

I 

45 

39.37 

59.06 


II 


68.78 

50.96 


III 


60.77 

73.09 


IV 


39.97 

72.15 

1111 (Rice) 

III 

34 

17.55 

28.89 


IV 


49.37 

65.07 

1114 (Saline) 

II 

59 

44.37 

40.87 


IV 


53.53 

39.42 


'^Two LACIE estimates corresponding to each bio- 
phase due to two photointerpretations. 
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TABLE A-V.- ANALYSIS OF VARIANCE - ITS WHEAT PERCENTAGES 


F-Ratio 

0.18 

H4 
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TABLE A- VI.- COEFFICIENTS IN THE MODEL® 


Term in model 

Coefficient value 

e 

0.74 

A 

11.79 


-2.48 

^2 

1.64 

®3 

-1.34 


2.18 


‘Model: = A + Bj + 8X, + e, 

i s If 2 , 3/ 4, 5 (segment number) 
j = 1,2,3 f 4 (biophase number) 
k = 1 f 2 (photointerpretation number) 


where 


Y. .. = LACIE-computed wheat percentage for ITS area in 
Lth segment with inputs from jt)i biophase and 
its Vith photointerpretation. 

= Actual wheat percentage for ITS area in ith segment. 

A = Overall bias at X = 0 . 

= Effect of ^th biophase on wheat estimation. 

e. = Error term associated with Y. .. . 
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APPENDIX B 


INTRA-CLUSTER CORRELATION COEFFICIENT 

Formula 

Let Y . be the measurement for the jth individual of the ith 
ij — 

cluster, j = 1, 2, M and i = 1, 2, N, and 

Y. = ly. ./M, the ith cluster average, and Y = EZy. ./NM, the 
1 j 13 ij 

overall average. Then, by definition, the intra-cluster cor- 
relation is 


r 


ZZZ (y . . 
ijk 
j?lJc 


Yi> (y.j^ - Y.)/NM(M - 1) 

r”7 

ZZ(y. . - Y) /MN 
ij 


This simplifies to (e.g., see ref. 6, p. 110) 


r 


2 

M a ^ - 
b 


(M - 1) 


2 

a 


“7 

a 


where 

2 

a = Total (between cluster plus within cluster) variance. 
2 

= Between cluster variance component. 


M = Number of individuals per cluster. 



Calculations 


From table 3-1, estimates corresponding to a and o. are 

b 

given by 


= 0.037698 


and 


S 


b 


2 


0.232592 - 0.028191 
30 


0.006813 


respectively. So, in the present case, 

30(0.006813) - 0.037698 _ « 

^ ^ 29(0.037698) 

2 

It may be mentioned that S = 0.037698 is also an estimate of 

the within county variance for the SRS segment and is certainly 

higher than expected. This is partly due to the fact that the 

ITS segments were located in different counties, and so it also 

includes the between county variance, as well. But then, so 
2 

does Sj^ . When r is computed as above, it is hoped that the 
effect of the between county variance is minimized. 
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